CDS

Accession Number TCMCG024C01634
gbkey CDS
Protein Id XP_021970716.1
Location complement(join(79114601..79115197,79116148..79116580,79117024..79117892,79118330..79119115,79119500..79119717,79120368..79120632))
Gene LOC110865701
GeneID 110865701
Organism Helianthus annuus

Protein

Length 1055aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022115024.2
Definition ubiquitin-activating enzyme E1 1 [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category O
Description Belongs to the ubiquitin-activating E1 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
ko04147        [VIEW IN KEGG]
KEGG_ko ko:K03178        [VIEW IN KEGG]
EC 6.2.1.45        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04120        [VIEW IN KEGG]
ko05012        [VIEW IN KEGG]
map04120        [VIEW IN KEGG]
map05012        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATTCCTGAAAAGAGATTAGCTGGAGGAGAGGAAGTAGTAGATGAATCTTTGATCAAGAGGACTAAAAGTGACACAGGGTCTTCTTCAACTGTAGCAACCATGGGTGGGGTAAACAACCCTAACGGTACCACCAATGAGAAATCAGATATTGATGAGGATCTCCATAGCCGACAGCTTGCAGTTTATGGTCGGGAAACTATGAGGCGCTTATTTGCATCCAATGTTCTAGTCTCTGGGATGCAGGGACTTGGTGCTGAAATCGCAAAAAACCTTATCCTTGCTGGTGTCAAGTCTGTGACATTGCATGATGAAGGAACCATCGAGTTGTGGGATTTATCTAGCAATTTCATTTTTACGGAGGATGACATGGGAAAGAACAGAGCTCTTGCTTCTCTCAACAAAATGCAAGAACTGAACAGTTCTGTTGTCATCTCTACATTAACAACTGAGTTAACTACAGAGGACCTCTCTGAGTTTCAGGCTGTAGTATTCACAGATATCAGTTTGGAGAAAGCAATCGAATTCGACAACTTTTGCCATAGACATGAGCCTCCAATCGCTTTTATAAAATCGGAAGTACGTGGACTTTTCGGTAGCGTGTTTTGTGACTTCGGTCCTAAATTCACTGTTTCCGATTTAGACGGAGAGGATCCACACACAGGCATTATTGCATCCATAAGCAATGACAACCCTCCACTTGTCACATTCATTGACGATGAACGGCTAGAGTTTCAAGACGGAGATCTAGTTACATTCTCTGAGGTTCAGGGGATGTCAGAGCTAAATGACGGGAAGCCAAGAAAGGTGATAAATGCAAAACCGTATTCATTCAGCATTGAAGAAGATACCACAAGTTACGGTGAATATAAGAGAGGAGGAATTGTCACTGAGTTAAAGCAACCAAAGGTGTTGCAATTTAAGCCACTGGAAGAAGCACTGAAAGATCCTGGTGAATTTCTTCTCAGTGATTTTTCAAAATTTGACCGCCCTCTGCTCTTGCATCTGTTATTTCAAGCACTGGATAAGTTTGTATCGGAGTTGCGGAGATATCCTGTTGCTGGATCTGAAACAGATGCTCAGAAATTGATTTCTTTGGTTTTTAGTATGCTGAAAGATGGAAATATTGATCGGGTTGATGAGAAAATTGTTAGAAACTTTGCGTTCGGTGCAAGGGCTGTGTTGAACCCCATGGCAGCCATGTTTGGGGGTATTGTTGGACAGGAAGTTGTTAAGGCTTGTTCTGGAAAGTTTCATCCACTACTTCAGTTTTTCTATTTTGACTCTCTCGAGTCTCTTCCTGTTGAGCCCTTGGATCCCGATGACTTGAAGCCCTTGAATAGCCGTTACGATGCTCAGATCTCAGTCTTTGGTGCTAAGCTGCAGAAACAATTAGAGGAAGCTAAAGTTTTCGTTGTAGGATCAGGAGCACTAGGCTGCGAGTTTTTGAAAAATTTAGCCTTAATGGGTGTTTCTTGCGGTAACGGAGGAAAGCTAACAATTACCGATGATGACGTTATTGAAAAAAGCAACCTTAGTAGACAGTTCCTTTTTCGAGATTGTAACATTGGTCAAGCTAAGTCAACCGTTGCAGCAACTGCTGCCACCTTGATAAACCCAAATTTTCATATTGAAGCACTTCAAAATCGTGCCAGCCCCGATACTGAAAATGTGTTTGACGACACTTTCTGGGAGGATCTTAGTGTTGTAATCAACGCTCTTGACAATGTGAATGCAAGGCTCTATATCGATCAGAGGTGTTTGTATTTTCAAAAACCGCTTTTAGAGTCTGGAACTTTAGGTGCCCTGTGTAACACACAGATGGTCATTCCTCACTTGACTGAGAACTATGGTGCCTCCCAGGACCCACAAGAGAAAGGCACGCCAATGTGTACTGTTCATTCTTTCCCGCACAACATTGACCACTGTTTGACTTGGGCTAGGTCAGAGTTTGAAGAATTGCTTGAGAAGACACCAGCTGAAGCAAATGCTTATCTGTTGAACCCAAGTGAATATATTTCTGGTATGGAAAAAGCTGGTGATGCACAGGCAAGGGATAAACTGGAACGTGTGCTTGAATGCCTAGAGACTGAGCGATGTGAATCATTTATAGACTGCATAACTTGGGCCCGCCTAAAGTTTGAAGATTACTTTGCTAACCGTGTGAAACAACTCACCTTCACTTTCCCAGAGGATGCTGTTAACACTAGTGGGGCACCTTTTTGGTCTGCCCCAAAACGTTTCCCGCGCCCCTTGCAGTTTTCTGTTGAAGACCAAAGCCACCTTAACTTTGTTATGGCAGCATCCATTTTACAAGCAGAGACTTATGGCATTCCAATTCCCAAGTGGGTCAAGTCTCATGCAAAGTTTGCTGATGCTGTTAGTGAAGTCGCGGTCCCTGATTTCGAGCCCAAGGAGGGTGTGAAGATTGTAACCGATGATAAAGACACTGATATGTCCACTGTATTCATTGATGACTCTGTTGTAATAAATGAATTGGTCAACAGGTTAAAATTGTGCTACAAGAACCTACCACAAGGCTTCAGGATGAACCCGATTCGGTTTGAAAAGGATGATGACACCAATTATCACATGGACCTAATAGCGGGACTAGCCAATATGAGAGCTCGGAATTACAGCATCCCTGAAGTTGACAAGCTCAAGGCCAAGTTCATTGCTGGCCGGATCATCCCCGCCATAGCAACCACAACCGCCATGGCCACCGGTTTTGTCTGCCTGGAGCTCTACAAGGTCCTGAACGGAGGTCACAAAGTAGAGGACTATAGGAACACCTATGTCAACCTGGCAACCCCTCTATTCTCCATGGCTGAACCTGTTCCGCCAAAGGTGATCAAACACCAGGACCTGAGCTGGACTGTTTGGGACCGTTGGATCCTCAGAGATGACCCGACATTAGGAGAGCTTCTTCAATGGCTAGAAAGTAAAGGACTGAAAGTTTTCATTATATCTTTTGGGAGCTATTTTCTTTATAACAGAATGGGTTCGAGTCATGGAGATAGGATGGATAAGAAGATGGTGAGTTTAGCTAAAGAAGTGGCCAAAGCCGATCTCCCTGCATACAGACGACATTTTGATGTGGTGGTGAACTGTGATGACAGTGATGGCAATAATGTTGATATTCCTCAGATCTCGATATACTTCAGGTAG
Protein:  
MIPEKRLAGGEEVVDESLIKRTKSDTGSSSTVATMGGVNNPNGTTNEKSDIDEDLHSRQLAVYGRETMRRLFASNVLVSGMQGLGAEIAKNLILAGVKSVTLHDEGTIELWDLSSNFIFTEDDMGKNRALASLNKMQELNSSVVISTLTTELTTEDLSEFQAVVFTDISLEKAIEFDNFCHRHEPPIAFIKSEVRGLFGSVFCDFGPKFTVSDLDGEDPHTGIIASISNDNPPLVTFIDDERLEFQDGDLVTFSEVQGMSELNDGKPRKVINAKPYSFSIEEDTTSYGEYKRGGIVTELKQPKVLQFKPLEEALKDPGEFLLSDFSKFDRPLLLHLLFQALDKFVSELRRYPVAGSETDAQKLISLVFSMLKDGNIDRVDEKIVRNFAFGARAVLNPMAAMFGGIVGQEVVKACSGKFHPLLQFFYFDSLESLPVEPLDPDDLKPLNSRYDAQISVFGAKLQKQLEEAKVFVVGSGALGCEFLKNLALMGVSCGNGGKLTITDDDVIEKSNLSRQFLFRDCNIGQAKSTVAATAATLINPNFHIEALQNRASPDTENVFDDTFWEDLSVVINALDNVNARLYIDQRCLYFQKPLLESGTLGALCNTQMVIPHLTENYGASQDPQEKGTPMCTVHSFPHNIDHCLTWARSEFEELLEKTPAEANAYLLNPSEYISGMEKAGDAQARDKLERVLECLETERCESFIDCITWARLKFEDYFANRVKQLTFTFPEDAVNTSGAPFWSAPKRFPRPLQFSVEDQSHLNFVMAASILQAETYGIPIPKWVKSHAKFADAVSEVAVPDFEPKEGVKIVTDDKDTDMSTVFIDDSVVINELVNRLKLCYKNLPQGFRMNPIRFEKDDDTNYHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATTTAMATGFVCLELYKVLNGGHKVEDYRNTYVNLATPLFSMAEPVPPKVIKHQDLSWTVWDRWILRDDPTLGELLQWLESKGLKVFIISFGSYFLYNRMGSSHGDRMDKKMVSLAKEVAKADLPAYRRHFDVVVNCDDSDGNNVDIPQISIYFR